Leveraging medical thesauri and physician feedback for improving medical literature retrieval for case queries

نویسندگان

  • Parikshit Sondhi
  • Jimeng Sun
  • ChengXiang Zhai
  • Robert Sorrentino
  • Martin S. Kohn
چکیده

OBJECTIVE This paper presents a study of methods for medical literature retrieval for case queries, in which the goal is to retrieve literature articles similar to a given patient case. In particular, it focuses on analyzing the performance of state-of-the-art general retrieval methods and improving them by the use of medical thesauri and physician feedback. MATERIALS AND METHODS The Kullback-Leibler divergence retrieval model with Dirichlet smoothing is used as the state-of-the-art general retrieval method. Pseudorelevance feedback and term weighing methods are proposed by leveraging MeSH and UMLS thesauri. Evaluation is performed on a test collection recently created for the ImageCLEF medical case retrieval challenge. RESULTS Experimental results show that a well-tuned state-of-the-art general retrieval model achieves a mean average precision of 0.2754, but the performance can be improved by over 40% to 0.3980, through the proposed methods. DISCUSSION The results over the ImageCLEF test collection, which is currently the best collection available for the task, are encouraging. There are, however, limitations due to small evaluation set size. The analysis shows that further refinement of the methods is necessary before they can be really useful in a clinical setting. CONCLUSION Medical case-based literature retrieval is a critical search application that presents a number of unique challenges. This analysis shows that the state-of-the-art general retrieval models are reasonably good for the task, but the performance can be significantly improved by developing new task-specific retrieval models that incorporate medical thesauri and physician feedback.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Medical Case-based Retrieval by Leveraging Medical Ontology and Physician Feedback: UIUC-IBM at ImageCLEF 2010

This paper reports the experiment results of the UIUC-IBM team in participating in the medical case retrieval task of ImageCLEF 2010. We experimented with multiple methods to leverage medical ontology and user (physician) feedback; both have worked very well, achieving the best retrieval performance among all the submissions.

متن کامل

Textual Methods for Medical Case Retrieval

Medical case retrieval (MCR) is information retrieval in a collection of medical case descriptions, where descriptions of patients’ symptoms are used as queries. We apply known text retrieval techniques based on query and document expansion to this problem, and combine them with new algorithms to match queries and documents with Medical Subject Headings (MeSH). We ran comprehensive experiments ...

متن کامل

Leveraging User Query Sessions to Improve Searching of Medical Literature

Published reports about searching medical literature do not refer to leveraging the query context, as expressed by previous queries in a session. We aimed to assess novel strategies for context-aware searching, hypothesizing that this would be better than baseline. Building upon methods using term frequency-inverse document frequency, we added extensions such as a function incorporating search ...

متن کامل

مسائل اصطلاحنامه سازی در ایران از دیدگاه تهیه کنندگان اصطلاحنامه

Introduction: The present research attempts to study the theoretical foundations of thesaurus construction before and after internet and identify the problems of thesaurus construction in Iran from the point of view of thesaurus makers and translators of the published thesauri.. Methods: The research population was 6 thesaurus makers (AbdolHossein Azaragn, Abbas Hori, Fatemeh Rahadoost, Faribor...

متن کامل

Exploring the use of concept spaces to improve medical information retrieval

This research investigated the application of techniques successfully used in previous information retrieval research, to the more challenging area of medical informatics. It was performed on a biomedical document collection testbed, Ž . CANCERLIT, provided by the National Cancer Institute NCI , which contains information on all types of cancer therapy. The quality or usefulness of terms sugges...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of the American Medical Informatics Association : JAMIA

دوره 19 5  شماره 

صفحات  -

تاریخ انتشار 2012